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A q-level alphabet is defined as a row vector space over a finite field with 
q elements. The letters of the alphabet are the rows of the vector space, each 
consisting of n symbols from the ground field. The weight of a letter is the 
number of nonzero symbols it contains. The minimum weight of the letters 
of the alphabet, excluding zero, is denoted by d. A relationship is established 
between the alphabet and a set of points S in a finite projective space. There 
is a many-one correspondence between the letters of the alphabet and the 
hyperplanes of the space. The weight of a letter is simply related to the 
incidence of the set S with the corresponding hyperplane. 

Two sets of points in a finite projective space are called equivalent if they 
are related by a collincation of the space. Two alphabets are called equivalent 
if there exists between them, as vector spaces, a weight-preserving semi- 
isomorphism. It is shown that these definitions mean the same thing and 
reduce to the usual definition when q = 2. 

An inequality is established between the dimension of the alphabet and 
the parameters d, q, n. This gives a lower bound for n in terms of the other 
parameters. It is shown that this bound cannot be achieved by alphabets 
with repeated columns. A method is given for constructing a class of alpha- 
bets which attain this bound. It is shown that for the case q = 2 these are 
the only alphabets (in the sense of equivalence) for which the bound is at- 
tained. 

I. INTRODUCTION 

A great deal of work has been done on error-correcting codes for the 
binary channel. In this paper we consider codes for a channel that can 
transmit more than two levels. Multiple-level transmission is practical 
if the channel is sufficiently quiet, as, for example, the submarine voice 
cable. It results in a substantial increase in bit rate and in added flex- 
ibility in choosing a code. One now has four parameters to adjust — the 
number of levels of transmission, the number of information symbols, 
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the number of redundant symbols, and the number of errors it is desir- 
able to detect and/or correct. Of course it cannot be decided without 
detailed analysis whether these advantages will more than compensate 
for the added complexity of the terminal equipment. 

In the binary case, systematic error-correcting codes have certain ad- 
vantages; 1 in particular, they are amenable to known mathematical tech- 
niques. It has been shown by Slepian that the words of a systematic 
code form a group under place-by-place addition mod 2. The natural 
generalization of a group code over the field (0,1 ) appears to be a vector 
space over a finite field of q elements. We call such vector spaces alpha- 
bets, and their individual elements are called letters. In the general case, 
a "code" becomes an "alphabet" and a word (unfortunately!) becomes 
a "letter." Each letter is a row of n symbols picked from the ground 
field ; the alphabet is a space of row vectors of length n. The q different 
symbols of the ground field correspond to q different transmission levels. 

Because only a restricted type of code is considered, some assumptions 
must be made about the nature of the channel and of the information 
being transmitted. These are as follows: 

(a) The number of transmission levels is a power of a prime number, 
since the number of elements in a finite field is a power of a prime. In 
practice this is not a severe restriction; between one and nine we have 
excluded only the number six. 

(b) The channel is "symmetric" in the sense that every symbol has 
the same chance of getting through correctly, and that the probability 
of one symbol being changed into another is the same for every pair of 
symbols. 

(c) All errors are equally bad. This might be the case, for example, if 
one were ordering merchandise from a mail order house by catalog num- 
ber only. 

With these assumptions the principles of error correction by a g-level 
alphabet are exactly the same as those described by Slepian for a group 
code (i.e., a two-level alphabet). For convenience, the pertinent results 
from Slepian's paper are summarized in the Appendix. The parameters 
of an alphabet, besides n and q are 

1. Its dimension as a vector space, denoted by k. The alphabet con- 
tains q k letters ; k is also the number of symbols in each letter which can 
be regarded as carrying information. The remaining n — k symbols are 
added for the purpose of error detection and/or correction. 

2. The minimum weight, d, of the letters of the alphabet other than 
(00 • • • 0). (The weight of a letter is the number of nonzero symbols it 
contains.) The quantity d is closely related to the error-correcting prop- 
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erties of the alphabet; if an alphabet is to be capable of correcting all 
occurrences of 1, 2, • • • , e errors in each letter it must have d = 2e + 1. 

The purpose of this paper is to investigate the properties of vector 
spaces over finite fields, particularly those properties which are related 
to the parameter d. The weight of a letter exists only in relation to a 
particular base of the vector space, which is an awkward situation in 
modern algebra. Hence our chief mathematical tool is not algebra but 
finite projective geometry. The connection between binary group codes 
and finite geometries was pointed out by Bose, 3 and is easily extended 
to the general case. 

We first establish several new definitions of equivalence between alpha- 
bets. (Two equivalent alphabets have the same error-correcting proper- 
ties.) A lower bound for n is found in terms of k, q and d. Clearly it is 
desirable to have n — k (the number of check symbols) as small as 
possible. It is shown that this lower bound can be attained, but only by 
a restricted class of alphabets. These alphabets are, on the whole, not 
practical for communication purposes unless the expected error rate is 
extremely high. However, the geometric methods used in the construc- 
tion of these alphabets can be applied to find useful alphabets for specific 
cases. The theorems derived for g-level alphabets apply equally well to 
the case q = 2 and contribute to the theory of binary group codes. 

II. NOTATION 

In this section we define the notation to be used in this paper and 
introduce Bose's theorem on the relation between alphabets and 
proj ective geomet ries. f 

Let F(q) be a finite field with q elements and characteristic p, and let 
F*{q) denote the nonzero elements of F(q). We consider a vector space 
of dimension n over F(q). Let G„(q) denote the "row space," i.e., that 
particular representation of the vector space consisting of all possible 
n-tuples of elements of F(q). For example, G-.(-i) consists of the 2-tuples 

(00) (10) (01) (11) (lw) (lw 2 ) 

(iwO) (Ow) (nno) (ww 2 ) (wl) 

(w 2 0) (Our) (wW) (w 2 l) (w 2 w) 

where w is a primitive cube root of unity. 

Clearly G„{q) has q" members. The q" — 1 nonzero elements of G„(q) 
can be divided, in many ways, into (q — 1) sets Gi , • • • , G g -i such 
that Gi = \Gj, A f F*(q). For our purposes it is usually enough to 

t For finite projective geometry, see Carmichael, 4 Ch. 2; for Clalois fields, see 
van der Waerden, 5 Ch. 5, Sect. 37. 
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examine only one of these sets, for example the first line in the table 
above. 

A subspace of G„(q) is called an alphabet over F(q) and its members 
are called letters. The length of a letter is n and the number of nonzero 
coordinates in a letter is its weight. Every alphabet contains the letter 
(00 • • • 0). The minimum weight of its other letters is denoted by d, and 
d is also called the weight of the alphabet. The dimension of the alphabet 
as a vector space over F(q) is k. By &(k,d,n) we mean an alphabet Ct 
with dimension k, weight d and length (of each letter) n. For example, 
G„(q) is Q(n,l,»). 

An alphabet CL(k,d,n) contains q letters, from which we pick any k 
independent vectors as generators. We write these as the rows of a 
k X n matrix M(Qt), the generator matrix of Ct. For example, 

1 1 



1 1 

is the generator matrix of an (2(2,2,3). We may assume that no column 
of a generator matrix consists entirely of zeros, for then the alphabet is 
isomorphic to a subspace of G n -i(q). 

An ordered set of k elements of F(q), not all zero (for example, a 
column of a generator matrix), may be regarded as the coordinates of a 
point of a projective space Tk~i{q), of projective dimension (k — 1), 
over F(q). We shall adopt the convention that a &-tuple which refers to 
a point of Tk~i(q) is to be written as a column vector, e.g., 



Qi = 



Qu 



T k -i(q) contains (q k - \)/{q - 1) points; if X 6 F*(q), Q and XQ are 
the same point. The points of Tk-i(q) are in one-to-one correspondence 
with one-dimensional subspaces through the origin in Gk(q). 
Let us now write the generator matrix of CL(k,d,n): 



Ri 

M(a) = R 2 



Qi Q 2 

?21 522 

?*i qk-> 



q kn 
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and call the rows R\,Ri, ••• , Rk and the columns Qi , Qa , • ■ , Qn • 

Regard the columns as a set of points in T k -i(q). There are exactly k 
independent columns, so this set of points spans the space T k -i(q). Let 
vi be the number of times which some multiple of the column Q, [the 
multiplier being an element of F*(q)] appears in M(G). The correspond- 
ing point in 7Vi(tf) shall then have multiplicity v,- . We can now intro- 
duce Bose's theorem, f 

Theorem 1 : Let 

be a general point of T k -i{q). Let S denote the set of points Qi , Q 2 , • • • , 
Q„ each counted with proper multiplicity. Then the weight of the letter 

R(X) = Xxft, + X 2 R 2 + • • • + X k R k , X, € F(q) 

of a is equal to the number of points of the set S which do not lie on the 
hyperplane 

H(X) = Xj/a + X 2 yo + • • • + XkVk = 

of TWtf). 

Proof: If, for example, 

Xitfn + Xotfoi + ■ • • + X*g*i - 0, 

the point Qi lies on H(X). The zeros in the letter R(X) arise from the 
points of S which lie on H(X), and the number of zeros will be the 
number of such points counted with proper multiplicity. The weight of 
R(X) is the number of its nonzero coordinates, which is the number of 
points of S (again counted with proper multiplicity) not lying on H(X). 
This proves the theorem. 

In Fig. 1, the projective plane 7 , 2 (2) is over the field (0,1). Note that 
Q4Q5Q6 are also collinear: 



1\ /o\ /o\ /O^ 




t A different proof of this theorem for the field (01) is given in Ref . 3. 
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Fig. 1 — Illustration of Theorem 1. 

Taking points Qi ,Q 2 ,Qs ,Qi in Fig- 1 as the set N we obtain a generator 
matrix 




of an alphabet ©(3,2,4). It is clear from the figure that there are at 
least two points of N not on any line of jP 2 (2). 



III. EQUIVALENT ALPHABETS 

In this section we take up the question of equivalent alphabets, and 
show how Slepian's definition of equivalence may be extended lo the 
more general case. First we discuss what properties one would intuitively 
hope for from such a definition. 

We may consider an alphabet as an array of letters arranged one under 
another in such a way that we can speak of its columns. We know that 
the operations of permuting the columns, multiplying any column by an 
element of F*(q), and interchanging the names of the nonzero symbols 
will not change the error-correcting properties of the alphabet. The defi- 
nition of equivalence between alphabets should allow us to do as many 
of these things as possible. 

From Bose's theorem we recall that the weight of every letter of an 
alphabet is determined by the properties of a set of points in Tk-i(q). 
First we wish that all alphabets derived from the same set of points 
should be equivalent; secondly, if two sets of points S,S' have, in some 
sense, the same incidence relations with the hyperplanes of Tk-\{q) they 
should give rise to equivalent alphabets. 
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Given a set of points S in Tk-i(q), we derive an alphabet from them 
by means of a generator matrix. We obtain the generator matrix by the 
following steps: 

1. Fix a coordinate system in Tk-i(q)j 

2. Write the coordinates of the points of S as columns of a matrix 
repeating each column (not necessarily consecutively) with the proper 
multiplicity. 

The order in which we write the columns is immaterial ; also if X, is 
such a column, we have the option of using XX,- , X € F*(q), instead. Thus 
it is apparent that a great many different generator matrices may arise 
from the same set of points. 

We shall presently give separate intrinsic definitions of equivalence 
between two sets of points, two matrices and two alphabets, and show 
how they are interrelated. First we give a brief description of the col- 
lineation group of 7V-i(})4 

A collineation is a mapping of the set of points of T k -i(q) onto itself 
which preserves all incidence properties; that is, it sends lines into lines, 
planes into planes, lines through a point into lines through a point, and 
so on. The collineations of T k -i(q) form a group, denoted by C(k,q). 
A nonsingular linear projective transformation of coordinates is a 
collineation; so is the (nonlinear) transformation of coordinates induced 
by an automorphism of the ground field F(q). Let P{k,q) be the group 
of linear projective transformations, and A(k,q) the group of trans- 
formations induced by automorphisms of the ground Held. Then any 
collineation of C(k,q) can be expressed as the product of a member of 
P(k,q) and a member of A(k,q). [Although an element of P(k,q) does 
not in general commute with an element of A(k,q), the two groups 
commute as subgroups of C{k,q).} We recall that an automorphism of a 
finite held of q = p m elements is always of the form — * 9''', where 6 
is a primitive element; and, for a nontrivial automorphism, < v < m. 
The integers of the field (the elements of the prime subfield) are not 
changed by such a mapping; hence a prime field has no nontrivial auto- 
morphisms, and in this case C(k,p) = P(k,p). 

We now make the following definitions of equivalence: 

Definition 1 : The (unordered) sets of points S,S' are equivalent if 
there exists a collineation of T/_i(r/) which sends S into S'. We write 
S' = C(S). 



t By a fixed coordinate system we mean that the coordinates of every point are 
fixed, except possibly for multiplication by an element of F *(q). In the case of 
finite projective geometries, this involves more than choosing the base points of 
the system. 

{ The subject is treated in great detail in Carmichael, 4 pp. 355-372. 
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Definition 2: Two (A; X n) generator matrices M,M' over F(q) are 
equivalent if 

W = g$M*, M* = MvA. 

Here $ is an automorphism of the ground field applied to the entries in 
M*, g an invertible (/c X k) matrix over F(q), xan (n X n) permutation 
matrix, A a (nonsingular) diagonal (n X n) matrix over F*(q). 

Since x has only one nonzero entry in each row and column we can 
always choose A' so that 

A'x = xA. 

Definition 3: Two alphabets ft and ft' are equivalent if there exists 
between them a weight-preserving semi-isomorphism. 

A semi-isomorphism / between two vector spaces ft, ft' is uniquely 
specified by describing what happens to the base vectors Ri , • ■ ■ , R k 
of ft, and choosing an automorphism of the ground field. The mapping 

f(Ri) = R'i, i = 1, •••,/<;, 

is a semi-isomorphism provided that Ri , ■ ■ ■ , R k are linearly independ- 
ent; any semi-isomorphism can be described in this way. 

We note also that a weight-preserving mapping of an alphabet (X onto 
an alphabet ft' is necessarily one-to-one; for only letters of zero weight 
in CI can map onto the zero (00 • • • 0) of ft'. 

In all of these definitions, equivalence has its usual properties; i.e., it 
is symmetric, reflexive and transitive. 

We now show that the three definitions are compatible; that is, in a 
sense to be made precise, 

Definition 1 — > Definition 2, 

Definition 2 — > Definition 3, 

Definition 3 — * Definition 1. 

Theorem 2: If 8,8' are equivalent in the sense of Definition 1, then the 
matrices M,M', to which they give rise in a fixed coordinate system, are 
equivalent in the sense of Definition 2. 

Proof: Let S be an ordering of the set S, and S' the ordering of *S" 
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into which S is sent by a collineation g$ of T k -i(q). If X t ,Xi are cor- 
responding points of S,S' their coordinates are given by 



X, = 



X\ = g 



*(Zl) 

[*(x„)\ 



Let M(S), M(S') denote the matrices with columns 

X lf ••-.X,,, Xi,.-,^, M(S') =g*M(S). 
Then there exist permutation matrices such that 

MV = M(S') = g*M(S'), M(S) = M-k. 
Hence 

M' = g*M*, M* = Mirir'" 1 = Mir*, 

where ir* is a permutation matrix. 

Theorem 3: If the generator matrices M,M' are equivalent in the sense 
of Definition 2, then the alphabets 6L,Q,' derived from them are equiva- 
lent in the sense of Definition 3. 

Proof: We have 

M' = (&M*, M* = Mir A. 

Let a* be the alphabet derived from M*. We set up a weight-preserving 
isomorphism h between a and a*, and a weight-preserving semi-iso- 
morphism / between a* and Ct'. We define h as follows: If R is a letter 
of CI then 

h{R) = RtA. 

This is clearly a weight-preserving mapping, since its effect is to permute 
the entries in R and multiply each entry by an element of F*{q). It is 
also linear, for if R\ , •• • , Rk are the rows of M, and Rf, ■■■ , R k * the 
rows of M* we have 

h(Ri) = R.rA = Ri*, 
h ( S auRt) = E oaRiTA = £ ctiRi*. 

\V=1 / . = 1 i=l 

We define f as follows: If R* - (n , r 2 , • • • , r») is a letter of a*, then 
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f(R*) = [4>(ri),<J>(r 2 ), ■•■ ,$(r»)];/is weight-preserving, since $(r) — 
implies ;• = 0. 

To show that /is a semi-isomorphism, 

a* -U a', 

we observe that g~ 1 M' is also a generator matrix of ft'. Let Ri , • • • , R k 
be the rows of g~ l M', and let R i = (r« ,r l2 , • ■ ■ , r in ). Let Ri*, • ■ • , R k * 
be the rows of M*, with it!,* = (r fl ,*r l2 ,*■•-, r*,*). Since g~ l M' = 9M* 
we have 

(r'a/a, ■ ■ ■,/•'„) = [*(r <1 *),#(r i2 *), ■ • ■ ,*(*■,■„*)], 
or 

/(«#•) = «'< . 

Then 
/ (g «**) - [• (g «*i*) , $ (g «,r«*) ,••,# (g «*Y.*)]. 

Since 4> is a field automorphism this becomes 

= [Z 4>(a,)*(r (1 *), E *(««)#(r«*) l •, S •(<*)*(rfa*)l 

L'=i ;=i «=i J 

t=i 
We then have 

/?. ^L> ft.* _L> R[ f XatRi -±> Z««R<* -U 2*(a,)# ■ , 

and A/ is a weight-preserving semi-isomorphism between & and Q'. 

Theorem 4'- Let &,&' be equivalent alphabets in the sense of Definition 
3, and M,M' be any generator matrices of &,&'. Fix the coordinate sys- 
tem in Tk-i(q), and let S,S' be the sets of points whose coordinates are 
the columns of M and M'. Then S and <S" are equivalent in the sense of 
Definition 1. 

Lemma: Let the alphabets &,&* be related by a weight-preserving 
isomorphism w; M,M* are generator matrices of a and a* such that 
M* = w(M). Then in any coordinate system in T k -\(q) the columns of 
M and M* give rise to the same (unordered) set of points S. 
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Proof of Lemma: If Ri 
and M* we have 



Rk ; #1*, • • • , Rk* are the rows of M 



Let (7/1 • • • y k ) be the coordinates of the general point of T k -i(q). Map 
the letters of ft onto the hyperplanes of T k -i(q) as follows: R, maps onto 
Vi = 0, 2Z <*<#< maps onto 2 *#< = °- Because of the isomorphism be- 
tween ft and ft* we have a similar mapping of the letters of ft* onto 
the hyperplanes of Tk-i(q): R* maps onto ?/, = 0, £ «■#.* maps 

onto 2 a &i = 0. 

Let / = (5,-y) be the incidence matrix of points and hyperplanes in 
T k -\{q), where 5,> = 1 if the ith point lies on the jth hyperplane and is 
zero otherwise. Each row (column) of I contains (g* _l — \)/{q — 1) 
ones and q h ~ x zeros. The matrix / for the projective plane T 2 (2) is il- 
lustrated in Table I. 

The matrix / is nonsingular. This is easily seen by considering the 
product /•/. In this, all terms on the main diagonal are equal to the 
number of points, a = (q k ~ l - l)/(q - 1), on a hyperplane. All other 
terms are equal to the number of points, b = (q'~ 2 — \)/(q — 1), on 
the intersection of two hyperplanes. The determinant of the matrix is 
then 

[a + („ - l)b](a - 6) B -\ 
When we substitute the values for a,b, the first factor becomes 

hence the determinant is not zero. (We assume k > 1.) 

Let Pi , P2, - • - , P? , M = (q k ~ l)/(q ~ 1), be the ordering of the 
points of T k -i(q) as they appear as columns of /. Let S,S* be the sets 

Table I — / = Incidence Matrix for Points and Lines in Ti(2) 





100 


010 


001 


no 


101 


Oil 


111 


Vi = 





1 


1 








1 





Vi = 


1 





1 





1 








1/3 = 


1 


1 





1 











w + ?/». = 








1 


1 








1 


yi + 2/3 = 





1 








1 





1 


y» + ?/a = 


1 














1 


1 


2/1 + yz + v* = 











1 


1 


1 
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of points whose coordinates are columns of M,M* respectively. Assign 
to Pi the multiplicity rii(ni*) with which it appears in the set S(S*). 
If Pi does not appear in S(S*), n, = (n* = 0). Form the column 
vectors 



n = 






, n* = 



wi 



»2 



The tth term of the matrix product In is the sum of the multiplicities 
of the points of S which lie on the tth hyperplane. By Theorem 1 , this 
is the number of zeros in the corresponding letters of Q. 

Since the isomorphism between & and Q,* is weight-preserving we have 



or, since / is invertible, 



In = In" 



n = n* 



Hence the set of points S* is at most a rearrangement of the set S. 

Proof of Theorem 4 '• & and Ct' are related by a weight-preserving semi- 
isomorphism /. Let Ri , • • • , Rk be the rows of the generator matrix M 
of Q,. R\ = f{R\), • • ■ , Rk = f(Rk) are k linearly independent letters 
of &', which we may take as the rows of a generator matrix M " of Q! . 
We can describe / as follows: 

f(Ri) = R", f ( g afiA = g <ti«)A7, 

where $ is an automorphism of the ground field which is uniquely deter- 
mined by / once we have chosen M. 



Let Rt* = # _I (B< ), i = 1, ■ • • , k; R x 



, R k * are linearly inde- 



pendent. Let M * be the generator matrix formed of these rows and ®* 
the alphabet derived from M*. The mapping h of &' onto d* induced 
by 4> -1 is clearly a weight-preserving semi-isomorphism. 
Consider the mapping fh between G and &*. We have 
/ 



2a,#, 



Ri -i-> RI 
- Z*(a<)ift 



RS, 

* -1 [2*(a,-)flT] =- XotiRi*. 



Since / is weight-preserving by hypothesis, fh is a weight-preserving 
isomorphism between a and Q,*; M and M* are corresponding generator 
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matrices under fh, hence by the Lemma they arise from the same set of 
points £ in T k -i(q). 

Let S" be the points of T k -i(q) corresponding to the columns of M" . If 

M 

;i -lil.V-li 

\4l W 

are the tth columns of M" and M* respectively, we have 

«\ few 

Hence the set S" is obtained from the set S* by a collineation C x of 

T k -i(q). 

Let M' be any generator matrix of a'; then M' = gM" . Let S' be the 
points of Tk-i(q) corresponding to the columns of W . S' arises from S" 
by a linear projective transformation, i.e., by a collineation C 2 . 

We have then 

S' = C 2 (S") = C 2 Ci(S), 

which proves the theorem. 

It can be shown from Theorems 2, 3 and 4 that a complete equivalence 
class of sets of points gives rise to a complete equivalence class of 
matrices; a complete equivalence class of matrices gives rise to a com- 
plete equivalence class of alphabets; and this in turn gives rise to a 
complete equivalence class of sets of points. The details of these cor- 
respondences are quite complicated, since an unordered set of points can 
give rise to many matrices, and different generator matrices can produce 
the same alphabet. 

Theorems 2, 3 and 4 are, of course, true over the field (0,1 ). We rewrite 
our definitions for this field, since they take a simpler form. * is the 
identity, and the only possible choice for A is the unit matrix. 

Definition I': Two sets of points S,S' in T k -i(2) are equivalent if 
they are related by a linear projective transformation of coordinates. 

Definition 2' : Two (k X n) matrices M,M' over F(2) are equivalent if 

M = gM'r, 

where tt is an (n X n) permutation matrix, and g an invertible (k X k) 
matrix over F(2). 
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Definition S'\ Two alphabets d,Qf over F(2) are equivalent if they 
are isomorphic as groups in such a way that corresponding elements 
have the same weight. 

It will be recognized that this is, in fact, the familiar definition of 
equivalence for alphabets over (0,1). 

IV. RELATIONS BETWEEN k, d, n 

In this section we establish certain relations between the parameters 
k,d,n, which are necessary conditions for the existence of an alphabet 
Q(k,d,n). We assume, as before, that the alphabet has no column con- 
sisting entirely of zeros. 

Define Z[x] to mean the least integer greater than or equal to the 
rational number x. 

Theorem 5:f A necessary condition for the existence of Q,(k,d,n) is that 



n > 



•&<£M 



Proof: As before, let / be the incidence matrix of points and hyper- 
planes in Tic-i(q). 

Let J be the complement of / obtained by replacing zeros by ones and 
ones by zeros. ./ is symmetric; each row (column) contains q~ ones and 
1 + q + • • • + q k ~ 2 zeros. 

The matrix J for the projective plane T 2 (2) over the field (0,1) is 
illustrated in Table II. 



Let 



n = 



where jtx = 1 + q + • • • + g* -1 and n< stands for the multiplicity of the 
point Pi of 7W<?). 

Consider the expression Jn. The product of the ith row of J with 
the column of n, is the sum of the multiplicities of the points P,- which 
do not lie on the ith hyperplane. By Bose's theorem, this is the weight 
of the letters of the alphabet corresponding to the ith hyperplane. Now 



t This theorem has been obtained for the field (0,1) by many authors in as many 
ways. See for example, Ref. fi, Theorem 5; Ref. 3, Eq. (52), and other authors 
quoted in Ref. 3. 
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Table II — J = Complement of I 








100 


010 


001 


110 


101 


011 


111 




Vi = 


1 








1 


1 





1 




yt = 





1 





1 





1 


1 




?/3 = 








1 





1 


1 


1 




?/i + v* = o 


1 


1 








1 


1 







|f I + 1/3 = 


1 





1 


1 





1 







?/2 + </3 = 





1 


1 


1 


1 








I/] 


+ yt + y* = 


1 


1 


1 











1 



define a column vector 



d = 



Since our alphabet is assumed to have minimum weight, d, we have the 
inequalities 

Jn ^ d. 

Since we may assume d ^ 1 , these inequalities imply that there must be 
at least one point of nonzero multiplicity not lying on any given hyper- 
plane — that is, the points of nonzero multiplicity span the space 

n_i(<7). 

Hence, given k and d, the least value of ft for which there exists an 
alphabet GL(k,d,n) is the minimum value of 

i=i 

where ft; , i = 1 , • • • , m are nonnegative integers which satisfy Jn ^ d. 
By adding all the inequalities of Jn ^ d, we obtain 



or, setting 



i=i 

n = S n, , 
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In the case that d = q k \ for any value of k the lower bound becomes 



„g|_J=i +9+ 



+ 5 



In this case the lower bound is the largest possible lower bound, as it is 
achieved by the alphabet which corresponds to ?h = n 2 = ■ • ■ = n M = 1 , 
that is, the alphabet which results from taking every point of Tt-i(q) 
with multiplicity one. 

One has an intuitive feeling that alphabets with the least n for a given 
k,d are likely to have no repeated columns if this is possible. This is 
partly justified by the following theorem. 

Theorem 6: If a generator matrix of &(k,d,n) contains a repeated 
column [in the sense that Q t = XQ g for some a of F*(g)], then 



"[^(fe-'H- 



For the purposes of this proof and the succeeding lemma we write the 
above inequality as 



n ^ 



z [M i -fM + >- 



Proof: Let P be the point of Tt-i(q) which corresponds to the re- 
peated column. Choose a coordinate system in which P is one of the 
base points, say P = ei . We then have an equivalent alphabet ft' 
which may be written 

1 1 ••• 



M(a') = 



1 











1 



The letters of Of to which the first two columns contribute zeros form 
a vector space ft; Q is generated by the rows 2, • • • , k of ilZ(ft'). The 
minimum weight of the letters of ft is at least as great as the minimum 
weight of the letters of ft'. Hence the alphabet ft has parameters k — 1, 
d', n — 2, with d' ^ d. By Theorem 3 we get 



or 
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It is clear that, if d ^ 2q l 



Q - 

We need a little more, namely: 
Lemma: If d ^ q~ , then 



i( 1 -pO- + »*«^t( 1 -?)* 



iO-p)^-*^ 1 -?)^ 1 



Hence 



Proof: 






+ ^ , ( - , , ) I '/ - 2 



tO-J) 



i 



--.H-p + s 



iO-J) 



rf + 1. 



g 

'Theorem 7: If d ^ ? t_1 the bound given in Theorem 3 cannot be at- 
tained by an alphabet with repeated columns. 

This result is not surprising in view of the remark at the end of the 
proof of Theorem 3. If d > q~\ the inequality of Theorem 5 gives 
n > (q — \)/{q — 1 ) ; i.e., n is larger than the total number of points 
in the space T k -i(q). Thus we must have repeated columns in the 
generator matrix. 

By repeated applications of the procedure of Theorem 6 we can write 
down lower bounds for the n of alphabets having a given number of 
columns with given multiplicities. However, this does not seem very 
interesting; we will first say what we can about alphabets with no re- 
peated columns. We assume from now on that we are dealing with such 
alphabets. 

V. A CLASS OF ALPHABETS 

In this section we describe a class of alphabets for which the bound of 
Theorem 5 is attained, and show how other alphabets which attain this 
bound may be derived from them. 
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We can immediately write down the class of alphabets.! Choose a 
fixed k, and consider the following sets of points in T k -i{q): 

(0) — The set S of all points of 3P*-i(ff) : 

7i = 1 + q + • • • + q k ~\ d = q k ~\ 

Every letter of this alphabet has weight d . 

(1) — The set Si of all points but one of Tk-i(q) - 

7h = q + q 2 + • • ■ + q k ~\ d, = q k ~ l - 1. 

The (1 -f» q + • • • + q ) hyperplanes through the omitted point 
correspond to letters of weight q , other hyperplanes to letters of 
weight q — 1 . 

(2) — S 2 = all points of T k ^i{q) except for the (1 + q) points of a 
line L\ . 

2 , , A-l j t-I 

n 2 = q + ■ ■ • + q , rf 2 = q — q. 

The (1 + q + • • • + q ~ 8 ) hyperplanes through L\ correspond to letters 
of weight q l , others to letters of weight q k ~ l — q. 

(3) — Si = all points of T k -\(q) except for the (1 + q + q) points 
of a plane P2 . 

3 1 , k-l , k— 1 2 

n-i = q + ■ •• + q , di = q — q . 

The (1 + 5"+ ••• + q ) hyperplanes through jP 2 correspond to letters 
of weight q~\ other hyperplanes to letters of weight q k ~ l — q. 



(k — 1) — Sk-i = all points of Tk-i(q) except for the points of a 
hyperplane 

fc-l 1 A-l fc-2 

n k -i = q , dk-i = q — q . 

The omitted hyperplane corresponds to letters of weight q k ~\ all others 
to letters of weight q — q ~ . 

It is easy to verify that for these alphabets the bound of Theorem 3 is 
attained. Consider 



f For the case of q = 2 some, or all, of these alphabets have been found by other 
authors by different methods. See, for example, Refs. 3 and 6. They are, of course, 
picked up by any systematic search, such as linear programming. (Ja-_i istheReed- 
Muller code for m = n, r = 1. 
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q - 1 V f I 

t* \ q - i J 

Since 5^2, this quantity is less than one; i.e., 

i > - - £ (fEr) * > °- 

It will appear presently that for q = 2 these are the only alphabets 
which attain the bound of Theorem 5. The case q > 2 is more compli- 
cated. 

Suppose that &(k,d,n) is an alphabet (with no repeated columns) for 
which 



Write 



f*- 1 \ 3- 1 / 

[ 



.-*id+^]. 

Let 

Qrf = sf/"' - r, ^ r ^ g* -1 - 1, < 8 ^ Q, (1) 

where >• and s are integers; s cannot be zero since d is positive; s ^ Q 
since d ^ g . Then 



" = 4 rf+s -^] 



= d + s. 



If we remove 77 columns from the generator matrix of (X in such a way 
that the remaining matrix is of rank A- (this is always possible for 
rj ^ n — A), we obtain an alphabet of length n — 77 and minimum 
weight d ^ d — 17. Let us consider the worst case, i.e., d = d — y. 
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Lemma: If 



»-*tM^W 



then 



provided that 



n — 77 = Z 



[^)«->)] 



V<(9-i)-^- 1 (r-t). (2) 



Proof: 



This is equal to d + s — 77 if and only if (Q77 + r)/q~ l < 1 ; i.e., 

.<Jtf- , -r)-(.-i) ? £ T -( t -»5 P £ ri 



or 



1 < (i -D-(ff- D^rrr- 



3* 

Now suppose that d > d — rj, and 77 satisfies (2) : 

, [( I + £) i ] al [( 1 + £) M -' ) ]-"-» 

By Theorem 3 applied to the alphabet Q(k,d,n — 77), we have 

»-* **[(!+£)«]■ 

Hence only equality is possible. 

Theorem 8: If &(k,d,n) attains the bound of Theorem 5 and 
GL(k,d,n — 77) is obtained from it by removing 77 columns from a gen- 
erator matrix of Ct, where 77 satisfies (2) in such a way that the remaining 
matrix is of rank A;, then the new alphabet also attains the bound of 
Theorem 5. 



MULTIPLE-LEVEL ERROR-CORRECTING CODES 301 

We remark that if we select the columns with proper care, it is possible 
to remove more than the number given by (2) and still obtain an alpha- 
bet which attains the bound of Theorem 5. The alphabets G 2 , • ■ ■ , G*_i 
listed at the beginning of this section are examples. 

We now reformulate (2) in a more convenient form. We observe, 
from (1), that, since d is an integer, so is {sq k ~ l — r)/Q. Subtract from 
it the integer 

and we find that 

s — r s — r 



Q g"-- + • • • + q + l 
is also an integer. We have two cases: 

i. 8 = Q, r = nQ [l*£q- lfrom(l)]. 
Then (2) becomes 

*<(g-l)-M+g 

or, since all these symbols represent integers, 

? ^( ff -l)-|u (3) 

ii. s < Q, r = M Q + s L < ( 1 - ,_/_ J (g-1) from (1) . 

Then (2) becomes 

9 < (q - 1) - ix pr- 

or 

V ^q - I ~ H - I = q-2 - n. (4) 

In case i we have, from (1), 

d = f /-' - M> ^ ii g q - 1. 

The alphabet d corresponds to the case n = 0, and the alphabet tti to 
H — 1. From the alphabet G we can subtract any number r; ^ g — 1 
of columns and obtain an alphabet which attains the bound of Theorem 
5. (The alphabet Qi is obtained by subtracting one arbitrary column.) 
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In case ii we have, from (1), 

For the alphabets (£ 2 , ■ ■ ■ , Ctk-i we have m = 0. This is readily verified 
by direct calculation: 

* - <z* _1 - t 1 = (TV - * - i) = (e - D^V" 1 ' -1 + • • • + q + 1). 

Thus 

& = (q k '~ 2 + • • • + (?' _1 ), n = 0. 

We shall show that these are the only alphabets besides GL for which 
M = 0. 

The generator matrix of an alphabet &{k,d,n) with no repeated col- 
umns consists of a subset of the columns of the generator matrix M(GLq). 
Let S be the generating points of d(k,d,n) in Tk-i(q), and denote by 
C(£) the points of Tk-i(q) which are not in S. 

Let v be the number of points in C(S) and 5 the maximum number of 
points of C(S) which do not lie on a hyperplane of Tk-i(q). The alphabet 
& then has length n — v and weight do — 8, where n = (q — 1)/ 
(q — 1), d = q k ~ l are the parameters of G - Using Theorem 5 on these 
numbers, we obtain 

or 

u(q-l) *( ff -^)a. 

Since v(q — 1 ) is an integer we may replace this by 

Kg- 1) ^ ff* - I- (5) 

This is the best we can do, since 8 ^ q . 

By some further manipulation we find that for the alphabet a, gen- 
erated by Cfc — £7(5), to attain the bound of Theorem 5 we must have 

(q8 - 1) - (q - 2) g ,(? " 1) ^ fl* " 1. (6) 

We also wish to have an alphabet with n = 0; for such an alphabet 

d = ^^ = »(fl " 1), 

8 = d - d = fl* -1 - s(q - 1), 
qS- 1 = (q k - 1) - sg(g - 1); 
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i.e., qS — 1 is divisible by (q — 1). From (6), the only possibility is 

v{q - 1) = qS - 1. (7) 

We also observe from (6) that if q = 2 we have v(q — 1) = qS — 1 
without any other considerations. 

To justify our statement that tt 2 , • • ■ , Ofe-i are the only alphabets 
besides a for which m = 0, we prove the following theorem. 

Theorem 9: If C(S) is a set of v points in Tk-i(q), with 8 denned as 
above, and v{q — 1) = qd — 1, then C(S) is the set of all points of a 
linear space in T k -i(q). This, of course, implies that 



= 1 + q + • • • + q', 8 = q°, 



1 < s < k - 2. 



Conversely, if C(S) is the set of all points of a linear space, then the 
alphabet d has n = and attains the bound of Theorem 5. This we have 
already verified. 

Proof: We have v — 8 — (v — \)/q, so that (v — 1) must be a multiple 
of q. If v = 1 the corresponding alphabet is (£i , for which n = 1. 

The proof is by induction on 8; we start by proving the theorem for the 
case (v — l)/q = 1; i.e., 5 = q, v = q + 1. 

Lemma: If v = 1 + q and 8 = q, the (1 + q) points X , Xi , • • • , X g 
are collinear, however large the containing space. 

An equivalent statement, which is the one we prove, is: If every 
hyperplane of Tk-i(q) contains at least one of the points X , Xi , • • • , 
X g , then X ,Xi, ••• ,X Q are the points of a line. 

We may assume that there is one hyperplane, say Yi = 0, which con- 
tains exactly one point X, , which we may call Xi . Pick another point 
for X and let the coordinates of these two points be ei , e 2 . We assume 
the coordinate system normalized so that the first nonzero coordinate of 
every point is unity. Write the coordinates of the X,- as columns of a 
matrix as follows: 

AO A] X 2 X3 * * " Xg 

Yi fl 1 1 ••• 1 x 



I", 



1 a 2 a-i ■■• a q 
b 2 b 3 ••• b q 



U U 



I, 



By Theorem 1, every letter of the form aYi + /3F 2 must contain at least 
one zero coordinate. For a = (fl = 0) we always have a zero in the 
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second (first) place of such a letter. In the (q — 1) letters of the form 
aYi + F 2 , a G F*(q), the zero mustoccur in one of the places 2, 3, • • • , q. 
Hence the a 2 , a 3 , • • ■ , a g above must denote some arrangement of all 
the elements of F*(q). 

Consider now letters of the form 

aYx + pYo + 7 F 3 . 

Again, the first two coordinate places take care of those letters for which 
one of a,/?,7 is zero. Hence we restrict ourselves to letters 

aY, + /3F 2 + F 3 , a,0 € F*(q). 

Each such letter must have a zero in one of the places 2, 3, — , q. 

We note that there are (q — l) 2 such letters, and (q — 1) coordinate 
places. 

Suppose now that 62 ?* 0. We shall count the number of letters to which 
the X 2 column contributes a zero. We may choose any a in F*(q) such 
that a 9± bo . /3( 5^0) is then uniquely determined by the equation 
[in F(q)] 

0a 2 = — (a + 62). 

Hence if b 2 5* the X 2 column contributes a zero to only (q — 2) letters. 
If fo 2 = we have (q — 1) choices for a, and /3 is determined by 

/?a 2 = —a. 

In this case the X 2 column contributes a zero to (q — 1) letters. 

Hence the only possible choice for fe t is &, = for all i. 

The same argument shows that all rows F, , i > 3, consist entirely of 
zeros. The coordinates of X 2 , • • • , X q are linearly dependent on those 
of X ,Xi ; that is, the points X 2 , • • • , X q all lie on the line joining 

Xq,Xi . 

Returning now to the main theorem we make the following induction 
hypothesis : 

Let C(S) be a set of v points in T k -i(q), with 8 defined as before, and 
such that 

(q - \)v = qb - 1. (7) 

Let q r ~ 2 < 8 ^ g r-1 , and assume that Theorem 9 is true for values of 
5 ^ q r ~ 2 . We wish to prove that 

i. 8 = q'\ *=l + g + «?*+■••+ q~\ 

ii. C(S) consists of all points of a linear space of projective dimension 
(r-1). 
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From (7), v — S = (v — l)/q = h, where h is an integer greater than 
1. h = 1 is the case already considered in the Lemma. Also 

5 = v — h = hq + 1 — h. 

An arbitrary space of dimension (k — 3), say D k -a , in Tt-\(q) will 
contain a number a of points of C(S). We wish to find a lower bound 
a for a. 

There are (q + 1) hyperplanes of T k _i{q) which pass through Z> A _ 3 • 
Denote by ft, , ft , • • • , /S a the number of points of C(£), outside of 
A_3 , contained by these hyperplanes. The hyperplanes through D k s 
contain among them all points of T k -i(q), so certainly all of C(S). We 
have then 

Q 

a+ E/3, : = v = hq+ 1, 

(8) 

a + ft : ^ v - 8 = h. 

A lower bound for a is obtained by making all the /3, equal, /3,- = /3, 
and replacing "^" by " = " in (8). Then, 

a + = A, 

a+ (? + l)/8 = %+ l. 

Solving these equations, 

h - 1 

5 = . 

Q 
/3 = /i = -. 

q q 

Let a' be the least integer containing a. We note that a' > 0. 

We may assume that some hyperplane, say H k _ 2 , of T k _i(q) contains 
exactly v — S = h points of C(S). Call this set of points C(S'). Each 
hyperplane of H k -2 is a (k — 3) -dimensional subspace of T k _i(q), and so 
by the previous result it contains at least a' points of C(S'). 

For C(S') we have 

/ = h, 8' £h- a £h- ^—1 . 

q 

Therefore, 

qS' - 1 ^ qh - h + 1 - 1 = (q - l)h, 
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or 

v'(q - 1) ^ qS' - 1. 
Comparing this with (5), only equality is possible; i.e., 

qb' - 1 = (q - l)h. 
This implies that (h — \)/q is an integer, and 

q <2 

C(S') is thus a set of points with 

v = h, b' = - and (q — 1)/ = qb' — 1. 
Q 

Since q r ~ 3 < b/q ^ q r ~ 2 we can apply the induction hypothesis, which 
gives us 

b' = q r ~\ v' = 1 + q + • • • + q~ 2 

or 

5 = qS ' = q r ~\ p = g/ + 1 = 1 + <? + • • • + q r ~ l 

and the points C(«S' ) are all the points of a linear space S r _ 2 in Z/*_ 2 . 
We can always find in ff*_ 2 a (A- — 3) -dimensional subspace, say 
D/c-a , which intersects i3 r _ 2 in a space of dimension (r — 3), and thus 
contains exactly 

ill i >— 3 ' l 1 

1 + q + ••• + q = = « 

points of C(S). 

Consider the hyperplanes of T k -i(q) which pass through Dk-a ■ From 
(8), we have for these 

ft - J - A - —^ , 

so that the total number of points of C(S) in each hyperplane is 

a + = h. 

By the previous argument the intersection of C(S) with each hyper- 
plane is a linear space of dimension (r — 2). These spaces have in com- 
mon a linear space of dimension (r — 3), the intersection of # r _ 2 and 
Dk-a . Hence the set of all their points is a linear space of dimension 
(r — 1). This proves the theorem. 
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We will now summarize the results of the last section. For q = 2, the 
alphabets d , Gi , ■ • * ,G*-i introduced at the beginning of the section 
are the only alphabets which attain the bound of Theorem 5. For q > 2 
these alphabets attain this bound, and have the further property that 
any fc-dimensional alphabet obtained from them by removing up to 
(q — 2) arbitrary columns of the generator matrix (q — 1 for G ) also 
attains this bound. They are the only alphabets with this property. 
Clearly the alphabets a , tti , • • • ,e*_i are completely determined, up 
to equivalence, by the values of the parameters k,d,n. For a given k, 
d and n are restricted to a certain set of values defined at the beginning 
of this section. 
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APPENDIX 

Slepian' s Error-Correction Procedure 

Let F(q) denote a finite field, and G»{q) the group, of order q", of 
all possible rows of n symbols picked from F(q). The group operation 
is place-by -place addition under the rules prevailing in F{q). Let A be 
a subgroup of G„ . [For the present purposes A need not be a vector 
space over F(q); the two concepts are the same if and only if F(q) is 
a prime field.] 

Partition G'„ into cosets with respect to .4, with an element of least 
weight in each coset being picked as "coset leader." The element (00 
• • • 0) is, of course, the coset leader of A itself. The cosets are formed 
into a table as illustrated in Table III. The group A is the first row of 
the coset table. The first column of the table contains the coset leaders. 
In the case of Table III these are, besides (0000), all the elements of 
weight 1 in Gi(3). 

The element in the sth row and the /th column of the coset table is 
obtained by adding the sth coset leader to the element (of A) in the 
first row and the tth column. The sth row is exactly the coset deter- 
mined by the sth coset leader, and every element of G n (q) appears ex- 
actly once in the table. 
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Table III — Cosets with Respect to A for G„(q) = G 3 (3) 





l 


2 


3 


4 


5 


6 


7 


8 


9 


1 


0000 


1011 


0112 


1120 


1202 


2022 


0221 


2210 


2101 


2 


1000 


2011 


1112 


2120 


2202 


0022 


1221 


0210 


0101 


3 


2000 


0011 


2112 


0120 


0202 


1022 


2221 


1210 


1101 


4 


0100 


1111 


0212 


1220 


1002 


2122 


0021 


2010 


2201 


5 


0200 


1211 


0012 


1020 


1102 


2222 


0121 


2110 


2001 


6 


0010 


1021 


0122 


1100 


1212 


2002 


0201 


2220 


2111 


7 


0020 


1001 


0102 


1110 


1222 


2012 


0211 


2200 


2121 


8 


0001 


1012 


0110 


1121 


1200 


2020 


0222 


2211 


2102 


9 


0002 


1010 


0111 


1122 


1201 


2021 


0220 


2212 


2100 



The error-correction procedure is as follows: If the received element 
is a letter of A it is accepted as correct. If not, it is located in the coset 
table, say in row s, column t, and the letter of A in row 1, column t is 
substituted. 

It is clear that the example of Table III will correct all single errors. 
Column 2 contains, besides (1011) which belongs to A, all the elements 
of (?3(3) which differ from (1011) in exactly one place. 

In general, if it is required to correct all single, double, etc., errors it 
is necessary that all elements of G„(q) of weights 1, 2, etc., appear as 
coset leaders in the coset table formed by A. Let d be the minimum 
weight of the letters of A , other than zero. The coset formed by a leader 
of weight 1 will consist of elements of weight at least (d — 1). Hence 
all elements of G„(q) of weight 1 appear as coset leaders if and only if 
d ^ 3. Similarly, all elements of weight 2 appear as coset leaders if 
and only if d g£ 5. If it is required to correct all e-fold errors, the al- 
phabet A must have d ^ 2e -f- 1. 
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